Semantic Video Trailers
نویسندگان
چکیده
Query-based video summarization is the task of creating a brief visual trailer, which captures the parts of the video (or a collection of videos) that are most relevant to the user-issued query. In this paper, we propose an unsupervised label propagation approach for this task. Our approach effectively captures the multimodal semantics of queries and videos using state-of-the-art deep neural networks and creates a summary that is both semantically coherent and visually attractive. We describe the theoretical framework of our graph-based approach and empirically evaluate its effectiveness in creating relevant and attractive trailers. Finally, we showcase example video trailers generated by our system.
منابع مشابه
Semantically controlled content-based retrieval of video sequences
In this paper, we present a technique for automatic classification of movies based on their content. This technique analyses shot duration and motion energy of movie trailers to characterize them as Action/Character movies. This approach is then combined with a features-based technique for content-based retrieval of video. Experiments indicate a high retrieval accuracy (greater than 96%) togeth...
متن کاملTrailer Generation via a Point Process-Based Visual Attractiveness Model
Producing attractive trailers for videos needs human expertise and creativity, and hence is challenging and costly. Different from video summarization that focuses on capturing storylines or important scenes, trailer generation aims at producing trailers that are attractive so that viewers will be eager to watch the original video. In this work, we study the problem of automatic trailer generat...
متن کاملWhere to Play: Retrieval of Video Segments using Natural-Language Queries
In this paper, we propose a new approach for retrieval of video segments using natural language queries. Unlike most previous approaches such as concept-based methods or rule-based structured models, the proposed method uses image captioning model to construct sentential queries for visual information. In detail, our approach exploits multiple captions generated by visual features in each image...
متن کاملLocomotion language in the wild: Biomechanical constraints and caveats
Semantic systems vary substantially across languages, but may nonetheless be constrained by structure in the world. Malt et al. (2008) advanced such an argument in the domain of locomotion. In their study, speakers of English, Spanish, Japanese, and Belgian Dutch named video clips of an individual locomoting on a treadmill. In all four languages, naming respected the distinction between walking...
متن کاملModels for automatic classi cation of video
In this paper, we explore a technique for automatic classiication of video sequences, (such as a TV broadcast, movies). This technique analyses the incoming video sequences and classiies them into categories. It can be viewed as an on-line parser for video signals. We present two techniques for automatic classiication. In the rst technique, the incoming video sequence is analyzed to extract the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1609.01819 شماره
صفحات -
تاریخ انتشار 2016